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ISOLATED NUCLEIC ACID MOLECULES WHICH ENCODE T CELL INDUCIBLE 
FACTORS (TIFs), THE PROTEINS ENCODED, AND USES THEREOF 



FIELD OF THE INVENTION 

This invention relates to newly isolated nucleic acid molecules and their uses. The 
nucleic acid molecules are shown to be upregulated by the cytokine interleukin-9 ("IL-9"). 
Also disclosed are the proteins encoded thereby. They are described as T Cell Derived 
Inducible Factors ( " TIFs " ) . These nucleic acid molecules encode proteins which induce STAT 
activation in cells. They can be used, for example, in the stimulation of regeneration of 
targeted tissues . Further, their inhibitors or antagonists can be used to retard, prevent or 
inhibit differentiation of other tissues. 

BACKGROUND AND PRIOR ART 

The last decade has seen knowledge of the immune system and its regulation expand 
tremendously. One area of particular interest has been that of research on the proteins and 
glycoproteins which regulate the immune system. One of the best known families of these 
molecules are the cytokines. These are molecules which are involved in the "communication" 
of cells with each other. The individual members of the cytokine family have been found to 
be involved in a wide variety of pathological conditions, such as cancer and allergies. 
Whereas sometimes the cytokines are involved in the pathology of the condition, they are also 
known as being therapeutically useful. 

Interleukins are one type of cytokine. The literature on interleukins is vast. An 
exemplary, but by no means exhaustive listing of the patents in this area includes U.S. Patent 
No. 4,778,879 to Mertelsmann et al.; U.S. Patent No. 4,490,289 to Stern; U.S. Patent No. 
4,518,584 to Market al.; and U.S. Patent No. 4,851,512 to Miyaji et al., all of which involve 
interleukin-2 or "IL-2. " Additional patents have issued which relate to interleukin-1 ("IL-1 "), 
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such as U.S. Patent No. 4,808,611 to Cosman. The disclosure of all of these patents are 
incorporated by reference herein. More recent patents on different interleukins include U.S. 
Patent Nos. 5,694,234 (IL-13); 5,650,492 (IL-12); 5,700,664, 5,371,193 and 5,215,895 (IL- 
11); 5,728,377, 5,710,251, 5,328,989 (IL-10); 5,580,753, 5,587,302, 5,157,112, 5,208,218 
(IL-9); 5,194,375, 4,965,195 (IL-7); 5,723,120, 5,178,856 (IL-6), and 5,017,691 (IL-4). 
Even a cursory review of this patent literature shows the diversity of the properties of the 
members of the interleukin family. One can assume that the larger cytokine family shows 
even more diversity. See, e.g. , Aggarwal et al. , ed. , Human Cytokines: Handbook For Basic 
And Clinical Research (Blackwell Scientific Publications, 1992), Paul, ed., Fundamental 
Immunology (Raven Press, 1993), pg 763-836, "T-Cell Derived Cytokines And Their 
Receptors", and "Proinflammatory Cytokines and Immunity." All cited references are 
incorporated by reference. 

The relationships between various cytokines are complex. As will be seen from the 
references cited herein, as the level of a particular cytokine increases or decreases, this can 
affect the levels of other molecules produced by a subject, either directly or indirectly. 
Among the affected molecules are other cytokines. 

The lymphokine IL-9, previously referred to as "P40," is a T-cell derived molecule 
which was originally identified as a factor which sustained permanent antigen independent 
growth of T4 cell lines. See, e.g. , Uyttenhove et al. , Proc. Natl. Acad. Sci. 85: 6934 (1988), 
and Van Snick et al. , J. Exp. Med. 169: 363 (1989), the disclosures of which are incorporated 
by reference, as is that of Simpson et al., Eur. J. Biochem. 183: 715 (1989). 

The activity of IL-9 was at first observed on restricted T4 cell lines, failing to show 
activity on CTLs or freshly isolated T cells. See, e.g., Uvttenhove etal .. supra , and Schmitt 
et al., Eur. J. Immunol. 19: 2167 (1989). This range of activity was expanded when 
experiments showed that IL-9 and the molecule referred to as T cell growth Factor III ("TCGF 
III") are identical to MEA (Mast Cell Growth Enhancing Activity), a factor which potentiates 
the proliferative response of bone marrow derived mast cells to IL-3, as is described by 
Hultner et al., Eur. J. Immunol, and in U.S. patent application Serial Number 498,182 filed 
March 23, 1990, the disclosures of both being incorporated by reference herein. It was also 
found that the human form of IL-9 stimulates proliferation of megakaryoblastic leukemia. See 
Yang et al., Blood 74: 1880 (1989). Recent work on IL-9 has shown that it also supports 
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erythroid colony formation (Donahue et al., Blood 75(12): 2271-2275 (6-15-90)); promotes 
the proliferation of myeloid erythroid burst formation (Williams et al. , Blood 76: 306-31 1 (9- 
1-90); and supports clonal maturation of BFU-E's of adult and fetal origin (Holbrook et al., 
Blood 77(10): 2129-2134 (5-15-91)). Expression of IL-9 has also been implicated in 
5 Hodgkins's disease and large cell anaplastic lymphoma (Merz et al., Blood 78(8): 131 1-1317 
(9-1-90). Genetic analyses of mice that were susceptible or resistant to the development of 
bronchial hyperresponsiveness have unraveled a linkage with the IL-9 gene as well as a 
correlation between IL-9 production and susceptibility in this model ( Nicolaides et al.. Proc. 
Natl. Acad. Sci. USA, 94, 13175-13180, 1997). Human genetic studies also point to the IL-9 

10 and IL-9R genes as candidates for asthma ( Doull etal.. Am. J. Respir. Crit. Care Med., 153 . 

1280-1284, 1996; Holrovd et al. . Genomics 52, 233-235, 1998). Secondly, IL-9 transgenic 
mice allowed for the demonstration that increased IL-9 expression result in lung mastocytosis, 
hypereosinophilia, bronchial hyperresponsiveness and high levels of IgE ( Temann et al., J. 
Exp. Med. 188, 1307-1320, 1998; Godfraind et al. . J. Immunol. 160, 3989-3996, 1998; 

15 McLane et al. . Am. J. Resp. Cell. Mol. 19:713-720 (1999). Taken together, these 
observations strongly suggest that IL-9 plays a major role in this disease Additional work has 
implicated IL-9 and muteins of this cytokine in asthma and allergies. See, e.g. PCT 
Application US96/12757 (Levitt, et al), and PCT US97/21992 (Levitt, et al), both of which 
are incorporated by reference.. 

20 IL-9 is known to affect the levels of other molecules in subjects. See Louahedet al. . 

J. Immunol. 154: 5061-5070 (1995; Demoulin etal. . Mol. Cell. Biol. 16: 4710-4716 (1996), 
both incorporated by reference. It will be recognized that the molecules affected have their 
own functions in biological systems. For example, Demoulin et al. show that many of the 
known activities of IL-9 are mediated by activation of STAT transcription factors. As such, 

25 there is continued interest in trying to identify molecules whose presence and/or level is 
affected by other molecules, such as cytokines. 

The disclosure which follows describes such molecules. It was found that nucleic acid 
molecules encoding the proteins of the invention were expressed in the presence of IL-9, but 
not in its absence. Hence, these molecules are, inter alia, "markers" for the expression or 

30 effect of IL-9 in a subject. The molecules are referred to as T Cell Derived Inducible Factors 
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or "TIFs" hereafter. These and other features of the invention will be seen in the disclosure 
which follows. 

BRIEF DESCRIPTION OF THE FIGURE 

Figure 1 compares deduced amino acid sequences of murine and human TIF (SEQ ED 
5 NOS: 27 and 28, respectively). 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 
Example 1 

The murine lymphoma cell line BW5147 is well known as a cell line which can be 
grown in vitro , without the need to add any cytokines to its culture medium. In order to 

10 identify genes induced by IL-9, samples of BW5147 were cultured either with (200 U/ml), or 
without IL-9, for 24 hours. Then, total RNA was isolated, using guanidium isothiocyanate 
lysis, and CsCl gradient centrifugation. These techniques are well known in the art. 
Following this, polyadenylated RNA was purified from the total RNA, by using an oligo(dT) 
cellulose column. The isolated, polyA RNA was then used to generate double stranded 

15 cDNA. A commercially available oligo(dT) primer was used. Anywhere from 3-5 ug of 
polyA RNA were heated to 70°C for 10 minutes with 1/ig of oligo dT, and then incubated 
with 5x first strand buffer (250 raM HC1 (pH 8.3), 375 mM KC1, 15 mM MgCl^, 10 mM 
dithiothreitol, 500 uM of deoxy nucleotide triphosphates, and 800 U of reverse transcriptase. 
Total volume of the reaction mixture was 20 ul, and the reaction was allowed to proceed at 

20 37°C for one hour. This resulted in synthesis of the first stand of cDNA. Second strand 
synthesis was accomplished by adding 30ul of 5 second strand buffer (lOOmM Tris-HCl (pH 
6.9)), 450mM KC1, 23mM MgCl 2 , 0.75mM 0-NAD+, 50mM (NH 4 ) 2 S0 4 , together with 60U 
of E. coli derived DNA polymerase I, 2U of E. coli RNase H, 10 U of E. coli DNA ligase, 
and 250 uM of deoxynucleotide triphosphates, and brought to a final volume of 150 ul. The 

25 mixture was incubated for two hours, at 16°C. 

The product was extracted using phenol-chloroform, and was precipitated with ethanol. 
The final cDNA product was then resuspended in 200 /ul of TE. 

These steps were carried out for both the stimulated BW5 147 cells ("tester" hereafter), 
and for parallel, unstimulated BW5147 cells ("driver" hereafter). 

4 
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Example 2 

The cDNA prepared in Example 1 was then subjected to subtraction cloning in 
accordance with well known methods. To do this, six oligonucleotides were prepared: 
5'-AGCACTCTCC AGCCTCTCAC CGCA-3 (SEQ ID NO: 1); 

5 ' -GATCTGCGGT GA-3' (SEQ ID NO: 2); 

5'-ACCGACGTCG ACTATCCATG AACA-3' (SEQ ID NO: 3); 

5 ! -GATCTGTTCA TG-3 1 (SEQ ID NO: 4); 

5 ' - AGGC A ACTGT GCTATCCGAG GGAA-3' (SEQ ID NO: 5); and 

5'-GATCTTCCCT CG-3' (SEQ ID NO: 6). 

These were used as explained herein. Double stranded cDNA (2 ug), was digested with 
restriction endonuclease DpnII, extracted with phenol-chloroform, precipitated with ethanol, 
and resuspended in 20 ul of TE (lOmM Tris-HCl (pH 7.5); ImM EDTA). Twelve ul (1.2 
ug), of cut cDNA was ligated to double stranded SEQ ID NOS: 1 and 2, in a mixture which 
included 4 ul of desalted SEQ ID NO: 1 (2mg/ml), 4 ul desalted SEQ ID NO: 2 (1 mg/ml), 
lO/d of 5X adapter buffer (330mM Tris-HCl, pH 7.6, 50mM MgCl 2 , 5mM ATP), 7pl DTT 
(lOOmM), and 28/xl of H 2 0). The oligonucleotides were annealed to each other and to the 
sample DNA by heating the mixture to 50°C and then cooling it to 10°C over one hour, 
followed by adding 5ul of T4 DNA ligase, and incubation for 12-14 hours, at 12-16°C. The 
mixtures were diluted by adding 140 ul of TE. PCR was then carried out on 200 ul samples, 
as described infra. 

Example 3 

To carry out PCR, 200 ul samples containing 2 ul of the ligation product in a buffer 
of 66 mM Tris-HCl, pH 8.8, 4 mM MgCl 2 , 16 mM (NH 4 ) 2 S0 4 , 33 ug/ml BSA, 0.3 mM of 
each dNTP (concentration: 500 pM)-, and 2 ug of SEQ ID NO: 2 were first heated at 72°C 
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for three minutes to remove any of SEQ ID NO: 1 which was hybridized to the product of 
Example 2. The 3* ends were then filled in by using 5U of Taq polymerase (5 minutes, . 
72°C). Twenty cycles of amplification were carried out (1 cycle:l minute at 95°C, and three 
minutes at 72°C), after which products were combined, phenol extracted, ethanol precipitated, 
and resuspended in TE buffer, at a concentration of 0.5 ug/ul. Hereinafter, this is referred 
to as the representation. 

Example 4 

The representation was then prepared for subtractive hybridization by removing SEQ 
ID NO: 1 therefrom by digestion with Dpn II. The resulting digest was phenol extracted and 
ethanol precipitated. In the case of the unstimulated sample, this resulted in the driver, while 
the stimulated sample resulted in the tester. Portions of tester (20 ug) were gel purified on 
a 1.2% agarose gel and isolated. Samples (2 ug), were ligated to SEQ ID NOS: 3 and 4, in 
the same way that SEQ ID NOS: 1 and 2 were ligated, as described, supra. 

In a first cycle of subtractive hybridization, 0.4 ug samples of tester with SEQ ID 
NOS: 3 and 4 ligated thereto were mixed with 40 ug of driver cDNA. The mixture was 
phenol extracted, ethanol precipitated, dissolved in 2 ul of 3XEE buffer (30mM EPPS pH 
8.0), 3mM EDTA; pH 8.0, 3 mM EDTA. This was overlaid with 30 ul of mineral oil, and 
denatured for five minutes at 98°C. A 5M NaCl solution (0.5 ul) was added, and DNA was 
hybridized for 20 hours, at 67 °C. The reaction mixture was diluted to 200 ul with TE, and 
tRNA carrier. The samples were incubated for three minutes at 72°C to melt away SEQ ID 
NO: 4, and then four PCR reactions (200 ul) were prepared. These included 20 ul of diluted 
hybridization mix without primer, to fill in the ends of the reannealed tester, followed by 10 
cycles of amplification after adding samples of SEQ ID NO: 3 (1 cycle: lminute at 95°C, three 
minutes at 70°C) after which products were combined, phenol extracted, ethanol precipitated, 
and resuspended in 40 p\ of 0.2XTE buffer. Single stranded DNA was degraded by a 30 
minute treatment of 20 /xl of this material with 20U of mung bean nuclease, at a total volume 
of 40 ul. Samples was diluted (1 :5), in 50 mM Tris-HCl, at pH 8.9, followed by five minutes 
of heating at 98 °C to inactivate the enzyme. A second PCR was carried out, using 20 ul of 
the product described supra , 2 ul of SEQ ID NO: 3 (1 mg/ml), and 1 ul (5 U) of Taq DNA 
polymerase. A total of 18 cycles (1 cycle: 1 minute at 95 °C, three minutes at 70°C) were 
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carried out. Products were combined, phenol extracted, ethanol precipitated, and resuspended 
at 0.5-1 ug//xl. The product is referred to hereafter as "DPI or the first difference product. 

Example 5 

DPI was then digested with endonuclease DpnII, as described above, and was ligated 
5 to SEQ ID NOS: 5 and 6, following the same processes described for SEQ ID NOS. 1,2,3 
and 4. Subtractive hybridization and selective amplification, as described in example 4, was 
repeated, and second difference product, or "DP2\ was generated. In these experiments, 50 
ng of DPI was the tester. The driver (40 ug), was as described supra . The process was 
repeated to generate a third difference product, using SEQ ID NOS: 3 and 4 as adapters. To 
10 generate the third product, 100 pg of tester were mixed with 40 jxg of driver. All steps of the 
protocols supra were repeated, except the final amplification was carried out for 22 cycles, 
where one cycle was one minute at 95°C, and three minutes at 70°C. This yielded the final 
difference product. 

Example 6 

15 The final difference products were digested with DpnII, and then cloned into the 

BamHI site of a commercially available vector, i.e. , ptZ19R. Double stranded DNA plasmids 
were prepared, and then sequenced, using standard methods. The sequences were compared 
to known sequences in the GenBank and EMBL data bases, using a BLAST search program. 
At the end of this subtraction procedure, a short cDNA fragment was identified, i.e., 

20 a fragment about 200 base pairs long. This fragment was used to screen a cDNA library from 
BW 5147 cells. The largest clone was sequenced. It is discussed infra. It does not 
correspond to any known sequence. 

The nucelotide sequence (SEQ ID NO: 7), is 1121 bases long, including a 537 base 
pair open reading frame, which encodes a protein 179 amino acids long. The predicted 

25 molecular weight of the protein is 20,093. There are two additional ATG codons which, if 
they acted as start codons, would produce proteins 172 and 167 amino acids in length, with 
molecular weights of 19,335 and 18,770 daltons, respectively. Each form of the protein is 
characterized by a sequence of hydrophobic amino acids which would be cleaved off of the 
molecule via the endoplasmic reticulum to provide a mature protein. 

7 
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Analysis of the sequence shows three AT rich motifs (TTATTTAT). These motifs are 
often found in 5' - untranslated regions of cytokines and oncogenes. Kruys. et al. . Science. 
245 : 852 (1989), have shown that these repeats modulate stability of mRNA for TIF. 

Example 7 

The cDNA isolated and analyzed in example 6, supra , was then used as a probe to 
identify genomic DNA for TIFa. 

A genomic library prepared from mouse strain 129 was screened with SEQ ID NO: 7, 
following standard methods. An EcoRI fragment from a positive clone was subcloned into 
plasmid pZERO and partially sequenced. The partial sequence is presented as SEQ ID NO: 

8. 

Example 8 

A second EcoRI fragment from the positive clone described in Example 7, supra , was 
also subcloned. There was a great deal of homology, but the sequences were not identical. 
To be specific, intron 1 of this sequence was 98% identical to SEQ ID NO: 8, intron 2 was 
100% identical and intron 3 was 92% identical. 

What is striking about the sequences is that the promoters are not at all homologous, 
suggesting independent regulation. The 5' untranslated regions are 92% identical. The first 
exon for TIFa is split into exon la and exon ip. The first coding exon (which is exon lb for 
TIFa and exon 1 for TIFP) are 99.5% identical, while the second exons are 100% identical, 
the third exons 97% identical, the fourth exons 98.5% identical, and 96% for the fifth exon. 
In the untranslated 3' - region, homology is 96%. 

Example 9 

Using the information described in example 8, supra , a cDNA sequence for the second 
clone, designated TIFP was deduced, and is set forth as SEQ ID NO: 9. The genomic DNA 
sequence was also ascertained, in the same manner as is described, supra , and is set forth as 
SEQ ID NO: 29. 

As compared to the coding region for TIFa, that of TIFp has six silent changes. There 
are two changes which result in an inconsequential amino acid change (at both of positions 36 

8 
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and 113, Val in TIFa becomes He in TIFp). There is also a more significant change, at 
position 1 12, where Gin becomes Arg. 

Example 10 

Experiments were undertaken to study expression of the TIFs. BW 5147 cells were 
stimulated with recombinant murine IL-9 (200U/ml), for varying periods of time (0.2, 0.5, 
1, 2 & 24 hours). Total RNA was then isolated, using standard methods and reagents. 
Reverse transcription was then carried out, using 5/xg total RNA and an oligo (dT) primer. 
Samples of cDNA corresponding to 20ng of total RNA were then amplified for 25 cycles 
using different primers. (One cycle was 4 minutes at 94 °C, 1 minute at 57°C, and 2 minutes 
at 72°C). The TIF primers were: 

S'-CTGCCTGCTT CTCATTGCCC T-3' (SEQ ID NO: 10) 
and 

5-CAAGTCTACC TCTGGTCTCA T-3' (SEQ ID NO: 11) 
(sense and antisense, respectively). 

These correspond to nucleotides 106-126, and 764-784 of SEQ ID NO: 7, respectively. 
As a control, P-actin was amplified as well, for 18 cycles (first cycle: 4 minutes at 94°C, 1 
minute at 60°C, 2 minutes at 72°C. Succeeding cycles were 1 minute at 94°C, 1 minute at 
60°C, 2 minutes at 72°C). 

Following amplification, post PCR products were analyzed on a 1 % agarose gel, and 
specific amplification was confirmed, following blotting, using internal radioactive probes. 
The probe for TIF was: 

5 ' -GACGC A AGC A TTTCTC AGAG-3 1 (SEQ ID NO: 12) 

the conditions and probes set forth were not specific for one or the other of the forms of TIF; 
however, the amplification product of TIFa contains a Kpnl restriction site, while the 
restriction site for TIFp does not. Digestion of the amplification products with Kpnl indicated 
that most, if not all, of the TIF mRNA induced by IL-9 was TIFa, suggesting that the TIFa 
expression was induced rapidly via the IL-9. The mRNA for TIFa was detectable after 30 
minutes of stimulation, and reached a plateau over a 1-24 hour time period. 

Example 11 
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Experiments were then carried out which showed that the induction of TIF mRNA by 
IL-9, described supra , does not require protein synthesis. In these experiments, total RNA 
was extracted from cells stimulated for 24 hours, as described in example 10, but with or 
without 10/ig/ml of a protein synthesis inhibitor, cycloheximide, for 4.5 hours. In a parallel 
set of experiments, cells were not stimulated. The total RNA was extracted, and RT-PCR 
amplification was carried out as described in example 10. Post-PCR products were analyzed 
on an ethidium bromide-stained, 1% agarose gel. What was seen was that the induction by 
IL-9 still occurred when protein synthesis was blocked. Hence, the effect of IL-9 is a direct 
effect, not requiring the synthesis of a protein mediator. 
Example 12 

In these experiments, the role of STAT proteins in induction of TIF mRNA was 
studied on derivatives of the cell line BW5147. The first line, BWh9R, expresses wild type 
human IL-9 receptors. The line BW-Phell6 is a transfectant with a single mutation (at 
position 116), which renders the receptor unable to activate STAT transcription factors. Still 
another cell line, BW-mut6, has a mutation which renders the receptor unable to activate 
STATS, while retaining the ability to activate STAT1 and STAT3. Finally, cell line BW-mut7 
has a single mutation which renders the IL-9 receptor unable to activate STAT1 and STAT3, 
but which retains the ability to activate STATS. 

Cell stimulation, isolation of total RNA, reverse transcription and amplification of 
cDNA were all carried out as described in example 10 (Cells were stimulated for 24 hours. 
Both human and murine recombinant IL-9 were used). The PCR products were analyzed on 
an ethidium bromide stained, 1 % agarose gel, as describe supra . 

The analysis revealed that human IL-9 did not induce expression in BW-Phell6, 
suggesting that STAT transcription factors are implicated. It was found that IL-9 induced TIF 
expression in the BW-mut6 mutant, but not the mut7 variant, suggesting that STAT1 or 
STAT3 are involved, but not STAT5. 

Example 13 

The expression of TIF mRNA in normal mouse spleen cells was then studied. 
Spleen cells from 10-12 week old Balb/c mice were cultured for 24 hours in control 
medium or the control medium supplemented with 20/zg/ml of LPS (which activates B 
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lymphocytes and macrophages), or ConA (which activates T cells), or ConA plus 1% of a 
blocking antiserum against murine IL-9, with p actin being used as a control. Purification of 
RNA, RT-PCR analysis were carried out as described supra . 

The data indicated that TIF is, at best, very weakly expressed in resting spleen cells, 
not induced by LPS, but strongly induced by ConA. Anti IL-9 antiserum did not affect 
induction by ConA, suggesting that its effect is not mediated by IL-9, or is mediated by other 
cytokines. 

When the ConA activated spleen cells were analyzed using sequences of RT-PCR 
products, it was found that these cells were expressing TIFcc predominantly, or exclusively. 

Example 14 

Further experiments showed that TIF mRNA was expressed even in the absence of IL- 
9 induction. 

Spleen cells from 5 week old FVB mice were enriched for T cells, using a nylon wool 
column. Then, the cells were stimulated for 24 hours in, medium supplemented with ConA 
(a T cell activator), or PMA (which activates PKC in most cells), either with or without IL-9. 

Total RNA was isolated using standard techniques, and then ten microgram samples 
were fractionated via electrophoresis on a 1.3% agarose gel containing 2.2M formaldehyde. 
The fractions were then transferred to a nitrocellulose membrane, labeled, and assayed in a 
hybridization assay following Van Snick, et al, J. Exp. Med. 169: 363 (1989), incorporated 
by reference. 

The results indicated that the induction of TIF by ConA was not modified, and that IL- 
9 did not induce TIF RNA in PMA activated spleen cells. 

Example 15 

The expression of TIF mRNA in various cell lines was tested. In these experiments, 
murine cell lines were stimulated for at least one day, with a particular cytokine. Specifically, 
9T7 is a T cell lymphoma, which responds to IL-2, IL-4 or IL-9. Cell lines TS3 and TS6 are 
derived from T helper cell clones, and proliferate in the presence of either IL-2 or IL-9. MC9 
and LI38 are mast cell lines, which proliferate in the presence of either IL-3 or IL-9. 
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Following stimulation, total RNA was prepared using standard guanidium 
isothiocyanate lyses, and CsCl gradient centrifugation. 

The 9T7 line was then analyzed by Northern blotting, as described in example 14, 
while the other lines were assayed using RT-PCR analysis, as described supra. 
5 It was found that IL-9 upregulated TIF expression in T helper cells and mast cells, 

while IL-2 and IL-3 did not. The 9T7 cell line, however, showed roughly the same level of 
expression, regardless of the cytokine, indicating that IL-9 is not mandatory for TIF 
expression. 

Example 16 

10 The expression of TIF mRNA in B cell lines was then studied. The cell lines A20, 

70Z/3, and BCL-1 are B cell leukemia cell lines which grow, in vitro , without cytokines. 
These cells were stimulated for 24 hours with IL-4 and IL-9 and total RNA was isolated, using 
standard methods. Expression was analyzed by RT-PCR which was carried out for 35 cycles, 
followed by blotting and hybridization, as described supra . 

15 The results indicated that TIF expression is detectable in B cells, but is weakly 

upregulated at best in the presence of IL-9 and IL-4. 

Example 17 

Experiments were then carried out to study expression of the inventive molecules in 
T helper cell lines. TS2 and TS1 are known T helper cell lines, derived from T helper cell 
20 clones, which proliferate in the presence of either IL-9 or IL-2 (TS2), and either IL-9 or IL-4 
(TS1). Specifically, TS1 or TS2 cells were grown in the presence of the listed cytokines for 
at least 10 days, after which RNA was extracted using known methods. Expression of the 
molecules was studied via RT-PCR (35 cycles), using the protocols described supra . In TS1 
cells both IL-4 and IL-9 induce TIF expression, but IL-2 does not do so in TS2 cells. 

25 Example 18 

Expression of TIF mRNA in various mouse organs were studied. Total RNA was 
prepared from liver, kidney, heart, brain, intestine, spleen, thymus, lung, muscle and bone 
marrow, using standard guanidium isothiocyanate methodologies and CsCl gradient 
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centrifugation. Forty cycles of RT-PCR were carried out, using the protocols described supra . 
Strongest expression was found in thymus tissue, while less intense signals were found in brain 
tissue, and weaker expression in the remaining tissues. 

Example 19 

The following experiments describe production of TIFa in 293-EBNA cells. 

Complementary DNA for TIFa was described supra . It was subcloned into a 
commercially available expression vector pCEP-4, in operable linkage with a CMV promoter. 
The resulting plasmids were transfected into 293-EBNA cells, using standard lipofectamine 
methods. Following transfection, the cells were incubated in a methionine free medium, 
supplemented with 35 S labeled methionine, for 24 hours. Supernatant was harvested, and run 
on an acrylamide gel, followed by electrophoresis. The gel was then dried and exposed to 
autoradiography for 1 day. A control was then run by transfecting cells with the same 
plasmid, in which the cDNA was cloned in the antisense direction. 

A heterogenous band of about 25-30 kilodaltons was found from the cells transfected 
with TIF in the sense direction. Any discrepancies between the predicted molecular weight, 
the actual molecular weight in the system, and the heterogeneity, can be attributed to 
glycosylation. In a series of parallel experiments, cDNA encoding human TIF was expressed 
in the same way as the murine cDNA was expressed. With the exception of the change of the 
cDNA, all experimental parameters were the same. 

Example 20 

Further experiments were carried out to study production of TIFa in COS cells. 
Specifically, TIFa cDNA was subcloned into the plasmid pEF-BOS.puro described by 
Demoulin et al . . supra , in operable linkage with the EF-la promoter. The plasmid cDNA was 
transfected into COS cells, using the same lipofectamine method described supra. The cells 
were incubated in methionine free medium, supplemented with 35 S methionine for 24 hours, 
after which supernatant was treated as described in example 20, supra . Again, a heterogenous 
band of 25—30 kilodaltons was observed, as well as an 18 kilodalton band, which probably 
represents a non-glycosylated form of the molecule. 
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Example 21 

In these experiments, it was discovered that TIF induces STAT activation in mesangial, 
neuronal melanoma, and hepatoma cells. It is known that when cytokines activate STAT 
factors, the factors dimerize, move from cytoplasm to the nucleus, and bind to target 
5 sequences in promoters. The details of the experiments follow. 

Transfected 293-EBNA cells as described supra were used following incubation in 
normal medium for 48 hours, as were supernatant from the controls, also described supra . 
Samples of a mouse kidney mesangial cell line, ("MES13 " hereafter), a rat pheochromocytoma 
cell line, ("PC12" hereafter), four different human melanomas (SK23, AUMA, NA-8mel and 

10 MULL), human heptaoma (HepG3) and rat hepatoma (H-4-II-K) were used. Cell samples 
(0.5xl0 6 ) were stimulated for 5-10 minutes in the presence of 1% of supernatant. Nuclear 
extracts were then prepared, in accordance with Demoulin et al. . Mol. Cell. Biol. 16: 4710 
(1996), incorporated by reference. In brief, cells were washed with PBS and then resuspended 
in 1 ml of ice cold hypotonic buffer for 15 minutes. (Buffer was lOmM HEPES buffer, pH 

15 7.5, with lOmM KC1, ImM MgCl 2 , 5% glycerol, 0.5 mM EDTA, O.lmM EGTA, 0.5mM 
dithiothreitol, and ImM Pefabloc, ImM Na 3 V 4 , and 5mM NaF). Cells were then lysed by 
adding 65 /xl of NP-40, followed by vortexing. Nuclei were pelleted, by vortexing for 30 
seconds at 14,000 rpm, followed by extraction in buffer supplemented with HEPES (20mM), 
glycerol (20%), and NaCl (420mM). Nuclear debris was removed by centrifuging for 2 

20 minutes. DNA binding activity was determined in accordance with Demoulin et al. . supra , 
using a 32 P labeled double stranded oligonucleotide called "GRR, " which contains the STAT 
binding site of the FcyRI gene promoter, i.e.: 
5'ATGTATTTCC CAGAAA-3' (SEQ ID NO: 13) 
and 

25 5 ' -CCTTTTCTGG GAAATAC-3' (SEQ ID NO: 14) 

corresponding to the upper and lower strands of the binding sites in the GRR probe. Briefly, 
5/il volume of nuclear extracts were incubated in binding buffer (12mM HEPES, pH 7.6, 
lOmM KC1, 0.5mM EDTA, 2.5% glycerol, O.lmg of poly(dl-dC) per ml) for 5 minutes. 
Radiolabeled GRR probe (lO^pm; approximately 0.5ng) was added, and incubation was 

30 continued for 25 minutes before loading onto a non-denaturing polyacrylamide gel. 
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It was also noted that the complexes observed in MES13 cells, described supra , were 
partially overshifted by both anti-STAT5 and anti-STAT3 antibodies, showing that (i) the cells 
under examination were targets for TIF, and (ii) that STAT3 and STATS are major 
components of the complex activated by TIF. The difference in STAT profile, as compared 
5 to the profile in Example 12, supra , is attributable to the difference in cell source (human 
versus mouse). It was also observed that human TIF works on murine cells, and vice versa. 

Example 22 

This example details the isolation and cloning of a nucleic acid molecule which encodes 
human TIF. First, human peripheral blood mononuclear cells were prepared via standard 

10 density gradient centrifugation. Following this preparation, samples were cultured for 24 
hours, at 3 x 10 6 cells/ml, either with or without anti-CD3 monoclonal antibody (The antibody 
was the commercially available OKT3 mAb, used in the form of ascites fluid at 1/500 
dilution). This antibody was used because T cell derived cytokines are generally expressed 
only upon activation by e.g., CD3 specific antibodies. 

15 Total RNA was isolated from these cells, using standard guanidine-isothiocyanate / 

CsCl ultra-centrifugation techniques. Following isolation, 10/ig samples of the RNA were 
reverse transcribed using an oligo (dT)15 primer. 

Following preparation of cDNA, as outlined supra , samples which corresponded to 
lOOng of total RNA were amplified, via PCR, using the following primers: 

20 5' - AGCTGCTCAA CTTCACCCTG GA - 3* (SEQ ID NO: 15) 
5' - CCACTCTCTC CAAGCTTTTT CA - 3* (SEQ ID NO: 16) 

which are based upon a murine cDNA sequence, (i.e., SEQ ID NO: 7). The PCR conditions 
involved 30 cycles of amplification, with one cycle defined as 1 minute at 94°C, followed by 
1 minute at 42 °C, and then 2 minutes at 72°C. Amplification product was separated on an 
25 agarose gel, using standard methods, and then sequenced. The result indicated that fragments 
of the cDNA had been amplified. Hence, a second reaction was carried but, using the same 
materials except SEQ ID NO: 16 was replaced by SEQ ID NO: 17, i.e.: 
5' - CAAGTCTACC TCTGGTCTCA T - 3' 

This second PCR reaction was carried out for 25 cycles, with one cycle being defined as 1 
30 minute at 94°C, followed by 1 minute at 45°C, and then 2 minutes at 72°C. The 
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amplification product was subjected to the same steps as the first one. Again, fragments of 
cDNA were amplified. 

Example 23 

Following preparation of amplification product, the 5' end of cDNA was isolated by 
using standard, 5' - RACE techniques. In brief, first strand cDNA was prepared by using 
SEQ ID NO: 18 as a primer, i.e.: 
5' - TGGCCAGGAA GGGCACCACC T - 3' 

This primer was based upon the sequence information obtained in accordance with example 
22. In brief, the 5* - RACE method was carried out by combining 1 //g of total RNA, 
prepared as described supra . 2.5 pmoles of SEQ ID NO: 18, reverse transcriptase , reverse 
transcriptase buffer, 2.5 //l of dNTP mix (10 mM), 2.5 yul of MgCl 2 (25mM), and 2.5 pi of 
dithiothreitol (0. 1 M). The reaction was carried out and, after completion, original RNA was 
removed via adding RnaseH, and Rnase TI. Any unincorporated dNTPs, as well as primer 
and proteins, were removed. The cDNA was tailed using terminal transferase, or TdT. n 
This enzyme creates a 3' -binding site for the abridged anchor primer, as described infra. 
Tailing was carried out by combining the purified, first strand cDNA, TdT, buffer (10 mM 
Tris-HCl, 25 mM KC1, L5 mM MgCl 2 ), and 200 fM of dCTP. 

Following the tailing reaction, PCR was carried out using 
5' - TGGCCAGGAA GGGCACCACC T - 3' (SEQ ID NO: 19), 
and 5' - RACE abridged anchor primer: 

5' - GGCCACGCGT CGACTAGTAC GGGIIGGGIIGGGHG - 3* (SEQ ID NO: 20). 
The amplification involved 35 cycles (1 cycle defined as 1 minute at 94°C, 1 minute at 56°C, 
and 2 minutes at 72 °C). Following this, nested amplification was performed on 5 ^1 of a 
1/100 dilution of the amplification product, using SEQ ID NO: 19 and the abridged universal 
amplification primer: 

5' - GGCCACGCGT CGACTAGTAC - 3' (SEQ ID NO: 21). 

Amplification involved 30 cycles (1 cycle being defined as 1 minute at 94°C, 1 minute at 
56 °C, and 2 minutes at 72 °C). The resulting PCR product was cloned, following standard 
procedures, and sequenced. 
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These three protocols, i.e., the two experiments described supra which generated 
fragments, and the 5' - RACE PCR, also described supra , permitted alignment of the 
sequenced amplification product, to generate the complete sequence. 

Following the alignment, oligonucleotides were generated which flanked the deduced 
5 open reading frame , i . e . : 

5' - CCTTCCCCAG TCACCAGTTG - 3' (SEQ ID NO: 22) 
and 

5' - TAATTGTTAT TCTTAGCAGG - 3 1 (SEQ ID NO: 23). 

These primers were used to amplify the entire open reading frame, using mRNA from CD3 
10 specific mAb stimulated cells, as described supra . For amplification, 25 cycles (1 cycle being 

defined as 1 minute at 94°C, 1 minute at 56°C, and 2 minutes at 72°C). 

The complete sequence of the human cDNA is set forth at SEQ ID NO: 24. 

As with the murine sequence, there are potential start codons at positions of SEQ ID 

NO: 24 which correspond to amino acids 1 and 13, as well as codons corresponding to 
15 methionine at amino acid positions 58, 85, and 92. The possible initiator codons correspond 

to proteins with calculated molecular weight of 19,998 daltons, and 18,735 daltons 

respectively (for 176 or 167 amino acids, respectively). As with the murine form of the 

protein, hydrophobic leader sequences are seen, indicating an N-terminal signal sequence of 

from about 20 to about 40 amino acids. 

20 Example 24 

These experiments detail work on the isolation of human genomic DNA corresponding 
to the cDNA discussed supra . 

Based upon the cDNA sequences, primers were developed which correspond to 
nucleotides 51-70 and the complement of nucleotides 631-650 of SEQ ID NO: 24. PCR was 

25 carried out, using standard methodologies. Specifically , lOOng of genomic DNA was used as 
a template, and 33 cycles of amplification were cararied out (one cycle of amplification being 
defined as 94° C for 30 seconds, 50° C for 30 seconds, and 72° C for 5 minutes). Once a 
sequence was isolated, it was sequenced, and this is set forth as SEQ ID NO: 25. The 
sequence is about 4.8 kilobases in length, and is believed to contain the entire genomic 
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sequence encoding the TIF molecule, lacking only the 5* flanking region, the promoter, and 
the 3' end. 

Example 25 

It was of interest to identify where the genomic DNA discussed supra was located in 
5 the human genome. In order to do this, two different approaches were taken. In the first, the 
sequence discussed supra, i.e. , SEQ ID NO: 25, was labeled with a flourescent label, and then 
was used to probe the human genome via fluorescent, in situ hybridization ("FISH") using 
standard methods. 

In a second approach, a panel of radioactive hybrid clones were screened using the 
10 probe consisting of nucleotides 51-70 of SEQ ID NO: 24, and 5 ' -ATCAGATGGA 
TTACTGA ATG-3 ? (SEQ ID NO:26). PCR was carried out using 25 ng of genomic DNA as 
a template, for 35 cycles, where one cycle is defined as 94 °C for in minute, 55° C for i 
minute and 72 °C for 2 minutes. 

Both methodologies indicated that the gene is located at chromosome 12ql5. Some 
15 work links diseases associated with asthma at this site. See, e.g. Nat. Genet. 15:389-392 
(1997); Ober, et al, Hum. Mol Genet. 7(9): 1393-1398(1998); Nickel, et al, Genomic 
46(1): 159-162(1997); Takahashi, et al, Genomics 44(1): 150-2(1997); Barnes, etal, Genomics 
37(1):41-50(1996), all incorporated by reference.. 

Example 26 

20 These experiments describe the manufacture of antibodies which bind to the TIF 

protein. To make these, a peptide consisting of amino acids 40-61 encoded by SEQ ID NO: 
7 was coupled to KLH carrier protein, using standard methods and a ratio of 1 mg peptide to 
1 mg carrier protein. Subject animals (rabbits), were immunized 3 times, at 2 week intervals, 
with 150 /ig of the complex. The immunogen was emulsified in Complete Freund's Adjuvant 

25 for the first injection, and then Incomplete Freund's Adjuvant for the next two. 

A first bleed was performed one month after the last injection, and serum was 
prepared, following known methods. 

The serum was then tested in a standard Western Blot. In brief, 10 /xl of supernatant 
from cells transfected with either SEQ ID NO: 7 or SEQ ID NO:24 were separated via SDS- 
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PAGE electrophoresis, and then blotted onto PVDF membranes. Antiserum was diluted to 
1 :500, and used in a standard Western Blot protocol, together with anti-rabbit antibody as the 
secondary antibody, and a commercially available detection kit. 

It was found that the serum did, in fact, recognize the TIF protein. 

In figure 1 , the deduced amino acid sequences of murine and human TIF are set out. 
The high degree of homology is seen in the boxed regions. 

The foregoing examples describe the invention, one aspect of which are isolated 
nucleic acid molecules, which encode TIF proteins such as those with the amino acid sequence 
of the protein encoded by the nucleotide sequence of SEQ ID NO: 7, 24 or 25. It will be 
appreciated by one of ordinary skill that the degeneracy of the genetic code facilitates the 
preparation of nucleic acid molecules which may not be identical to the nucleotide sequence 
of SEQ ID NO: 7, 24 or 25, but which encode the same protein. Of course, SEQ ID NOS: 
7, 24 and 25 are preferred embodiments of this invention, but other embodiments are also a 
part of the invention. Genomic DNA, complementary DNA, and RNA, such as messenger 
RNA, are all to be included therein. Isolated nucleic acid molecules from other animal 
species, including other mammals, are also a part of the invention. A preferred aspect of the 
invention are isolated nucleic acid molecules whose complements hybridize to SEQ ID NO: 
7, SEQ ID NO: 8, SEQ ID NO: 9, or SEQ ID NO: 24 under stringent conditions. "Stringent 
conditions, " as used herein, refer, for example, to hybridization at 65°C in buffer (3.5xSSC), 
0.02% Ficoll, 0.02% polyvinylpyrrolidone, 0.02% bovine serum albumin, 25mM NaH 2 P0 4 
(pH 7), 0. 1 % SDS, 2mM EDTA, followed by a final wash at 2xSSC, room temperature and 
then 0.1xSSC/0.2xSDS at temperatures as high as, e.g., about 65°C. More stringent 
conditions, such as 0. IxSSC, can also be used. These nucleic acid molecules encode proteins 
of about 17-22 kD as determined by SDS - PAGE, which activates STAT proteins, such as 
STAT 1,STAT3 and/or STAT5. In glycosylated form, these proteins can range from about 
17 to about 30 kilodaltons, as determined by SDS - PAGE. 

Also a part of the invention are expression vectors which include the nucleic acid 
molecules of the invention, operably linked to a promoter, so as to facilitate expression of the 
DNA. It is well within the skill of the artisan to prepare such vectors. 

The vectors, as well as the nucleic acid molecules per se, can be used to prepare 
recombinant cells, be these eukaryotic or prokaryotic, wherein either an expression vector or 
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the nucleic acid molecule itself is incorporated therein. E. coli cells, COS cells, CHO cells, 
etc., are all examples of types of cells which may be used in accordance with this aspect of the 
invention. 

Proteins encoded by the above referenced nucleic acid molecules, preferably in isolated 
5 form, are another feature of this invention. By "protein" is meant both the immediate product 
of expression of the nucleic acid molecules, glycosylated forms of it, as well as multimeric 
forms, such as dimers, trimers, and so forth. Also a part of the invention are multimers, such 
as dimers, which contain at least one protein molecule of the invention, and at least one, 
different protein molecule. Preferably, this different protein molecule is a cytokine, such as 

10 IL-10. Also included as a feature of the inventions are constructs, such as fusion proteins, 
where all or a part of the proteins described supra are linked in some fashion, such as in a 
fusion protein, to at least one addtional protein or peptide, or amino acid sequence. The 
"fusion partner" may be, for example, a molecule which provides a recognizable signal, either 
directly or indirectly, such as a FLAG peptide, p-galactosidase, luciferase, and so forth. 

15 These fusion partners are preferably joined to the molecule which is described supra at the N- 
and/or C- terminus of the protein; however, it is to be understood that there are many 
techniques known for joining molecules to amino acids, and any and all of these methodologies 
can produce constructs which are a part of the invention. 

The individual protein molecules of the invention, as noted supra , will preferably have 

20 a molecular weight of from about 17 to about 30 kilodaltons, as determined by SDS-PAGE. 

In multimeric forms, the molecular weight of the complex will, of course, vary, but the TIF 
molecules contained therein will each have a molecular weight of about 17 to 30 kilodaltons, 
as determined by SDS-PAGE. 

The proteins preferably consist of at least about 120 and no more than about 200 amino 

25 acids. Preferably, the amino acids sequences consists of or comprises all or part of the amino 
acid sequences encoded by SEQ ID NOS: 7,8, 9, 24 or 25. More preferably, the amino acid 
sequence contains all but about the first 40 amino acids encoded by said SEQ ID's. Even 
more preferably, it contains all but about the first 20 amino acids encoded by these sequences. 
Most preferably, the protein comprises amino acids set forth at SEQ ID NO: 27 or 28. 

30 It will be appreciated by the skilled artisan that the proteins encoded by the above 

recited nucleic acid molecules are a feature of the invention, and may be used to produce 
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antibodies, in accordance with standard protocols. Such antibodies, in monoclonal and 
polyclonal form, constitute a further feature of the invention as do fragments of said 
antibodies, chimeric forms, humanized forms, recombinant forms, and so forth. Also a 
feature of the invention are immunogens, comprising all or a part of the amino acid sequence 
5 protein molecules of the invention, preferably combined with an adjuvant, such as Complete 
or Incomplete Freund's Adjuvant. Portions of the protein sequences may be linked to other 
molecules, such as keyhole limpet hemocyanin, to render them more immunogenic. These 
antibodies can be used, e.g., to determine if the proteins of the invention are present. This 
is a further feature of the invention, as is now explained. It has been shown, in the examples, 

10 that the nucleic acid molecules of the invention were expressed in the presence of the IL-9. 
Hence, a further feature of the invention is a method to determine if IL-9 is or has been 
present, wherein one detects either the proteins of the invention, using antibodies for example, 
or mRNA using the nucleic acid molecules of the invention, as probes. The mRNA can be 
determined directly, or in the form of cDNA. Such probes may or may not be labeled, as a 

15 matter of choice for the user. Hence, one can determine, for example, if, following 
administration of IL-9, the cytokine is still efficacious, by determining if the nucleic acid 
molecule of the invention is present. This type of assay can be adapted, for quantitative 
studies, wherein one determines, for example, either if a cell is sensitive to IL-9, and if so, 
how sensitive it is. One can also use the proteins of the invention to phosphorylate STAT 

20 proteins such as STAT1 , STAT3 and/or STAT 5. This in turn results in dimerization of the 
STAT protein, followed by migration to the nucleus to provoke the effect that these STAT 
proteins have on cells. 

One could also use these molecules to test the efficacy of IL-9 agonists or antagonists 
when administered to a subject, such as a subject suffering from lymphoma , an immune 

25 system disorder such as an allergy, acquired immune deficiency syndrome, autoimmune 
diabetes, thyroiditis, or any of the other conditions described in , e.g, US Patent No. 
5,830,454; 5,824,551, and pending application Serial No. 08/925,348, filed on September 8, 
1997 now allowed, all of which are incorporated by reference. The molecules can also be 
used to mediate the role of IL-9 in these and other conditions. To elaborate, since IL-9 

30 induces TIFs, the TIFs are useful as IL-9 activity mediators. Thus, a further aspect of the 
invention is a method to determine activity of endogenous IL-9, such as in situations where 
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excess IL-9 activity is implicated, such as asthmas, allergies, and lymphomas. One can also 
block or inhibit IL-9 activity by blocking or inhibiting TIF or TIF activity, using, e.g., 
antisense molecules, antibodies which bind to TIF, or other antagonists of these molecules. 
For example, m uteins of TIF, which bind to the TIF receptor but do not activate it, therby 
inhibiting IL-9 induced activity, are a feature of the invention. Examples of conditions which 
can be treated by the use of such TIF muteins are allergies, asthma, and so forth. Muteins in 
accordance with the invention can be made in accordance with, e.g., Weigel, et al, Eur. J. 
Biochem 180(2):295-300(1989) and Epps, et al, Cytokine 9(3): 149-156(1997), both of which 
are incorporated by reference. Such muteins can be used in the treatment of asthma, allergies, 
or both. Further, it will be clear to the skilled artisan that the models set forth, supra , can also 
be used to screen for appropriate muteins/ The ability to regulate IL-9 activity is important 
in conditions such as those listed supra , as well as conditions such as apoptosis, including 
Cortisol induced apoptosis, conditions involving the nuclear expression of BCL-3, since IL-9 
is known to induce such expression, and so forth. "Antibodies, " as used herein, refers to any 
portion of an antibody which binds to TIF, including chimeric and humanized antibodies. 

Another feature of the invention relates to the ability of the TIF type molecules of the 
invention to either promote regeneration or inhibit differentiation of tissue types on which the 
molecules are active. As was shown, supra , the TIF molecules target various cancer and 
normal cell lines (i.e. , mesangial and neuronal cells, as well as melanoma and hepatoma cells). 
Hence, one can stimulate regeneration of tissue via, e.g., adding an amount of a TIF type 
molecule to a sample in need of regeneration of a tissue acted on by the TIF molecule. This 
approach can be used both in vitro, and in vivo. Similarly, antagonists of TIF may be added 
when the situation is one where the aim is to inhibit differentiation of a particular type of 
tissue, such as melanoma or hepatoma. 

The genes which encode TIF, as noted in Example 25, supra , are located on 
chromosome 12. This chromosome is associated with asthma, as is known in the art. Hence, 
a further embodiment of the invention is a method for determining susceptibility to conditions 
such as, or related to asthma, by determining if aberrations, such as polymorphisms, deletions, 
additions, etc. , are present at the site of the TIF gene. Such aberrations may be an indicia of 
susceptibility to, or of the presence of, asthma, an allergic condition, or one or more related 
conditions. The ability to detect aberrations in a DNA sequence is well known in the art, and 

22 



WO 00/24758 



PCT/US99/24424 



such methods need not be set forth herein. Preferably, the aberration or aberrations is detected 
via standard techniques, such as PCR, using the methodologies and primers referred to supra . 
Other features of the invention will be clear to the artisan and need not be discussed 

ftuther. 

5 The terms and expressions which have been employed are used as terms of description 

and not of limitation, and there is no intention in the use of such terms and expressions of 
excluding any equivalents of the features shown and described or portions thereof, it being 
recognized that various modifications are possible within the scope of the invention. 
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Claims 

1 . An isolated nucleic acid molecule which encodes a T cell derived inducible factor, the 
complementary sequence of which hybridizes, under stringent conditions, to at least 
one of SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO:29, SEQ ID 
NO: 24 or SEQ ID NO:25. 

2. The isolated nucleic acid molecule of claim 1, wherein said isolated nucleic acid 
molecule encodes a protein having the amino acid sequence of the protein encoded by 
SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO:29, SEQ ID NO: 24 or 
SEQ ID NO: 25. 

3. The isolated nucleic acid molecule of claim 1, wherein said molecule is cDNA. 

4 . The isolated nucleic acid molecule of claim 1 , wherein said molecule is genomic DN A . 

5. The isolated nucleic acid molecule of claim 2, the nucleotide sequence of which 
consists of the nucleotide sequence SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, 
SEQ ID NO: 29, SEQ ID NO: 24 or SEQ ID NO:25. 

6. The isolated nucleic acid molecule of claim 4, having the nucleotide sequence of SEQ 
ID NO: 25. 

7. An isolated nucleic acid molecule which encodes the protein encoded by the isolated 
nucleic acid molecule of claim 1. 



8. Expression vector comprising the isolated nucleic acid molecule of claim 1 , operably 
linked to a promoter. 
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9. Expression vector comprising the isolated nucleic acid molecule of claim 2, operably 
linked to a promoter. 

10. Expression vector comprising the isolated nucleic acid molecule of claim 3, operably 
linked to a promoter. 

1 1 . Expression vector comprising the isolated nucleic acid molecule of claim 4, operably 
linked to a promoter. 

12. Expression vector comprising the isolated nucleic acid molecule of claim 5, operably 
linked to a promoter. 

13. Expression vector comprising the isolated nucleic acid molecule of claim 6, operably 
linked to a promoter. 

14. Recombinant cell comprising the isolated nucleic acid molecule of claim 1. 

15. Recombinant cell comprising the isolated nucleic acid molecule of claim 2. 

16. Recombinant cell comprising the expression vector of claim 8. 

17. Recombinant cell comprising the expression vector of claim 9. 

18. Recombinant cell comprising the expression vector of claim 10. 

19. Recombinant cell comprising the expression vector of claim 11. 

20. Isolated protein encoded by the isolated nucleic acid molecule of claim 1, and having 
a molecular weight of about 17-30 kilodaltons as determined by SDS - PAGE. 
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21. The isolated protein of claim 20, comprising at least 120 amino acids of the protein 
encoded by SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO:29, SEQ ID 
NO: 24, or SEQ ID NO: 25. 

22. The isolated protein of claim 21 , comprising at least all but the 40 N terminal amino 
acids encoded by SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO:29, 
SEQ ID NO: 24, or SEQ ID NO: 25. 

23. The isolated protein of claim 22, comprising at least all but the 20 N terminal amino 
acids encoded by SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO:29, 
SEQ ID NO: 24, or SEQ ID NO: 25. 

24. Antibody which specifically binds to the isolated protein of claim 20. 

25. The antibody of claim 24, wherein said antibody is a monoclonal antibody. 

26. A method for determining effectiveness of interleukin-9 on a cell, comprising 
contacting said cell with an agent specific for at least one of (i) an isolated nucleic acid 
molecule which encodes a protein whose amino acid sequence is identical to the amino 
acid sequence encoded by the nucleotide sequence of SEQ ID NO: 7, SEQ ID NO: 8, 
SEQ ID NO: 9, SEQ ID NO:29, SEQ ID NO: 24 or SEQ ID NO: 25 and (ii) a protein 
whose amino acid sequence is identical to the amino acid sequence encoded by the 
nucleotide sequence of SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID 
NO:29,SEQ ID NO: 24 or SEQ ID NO: 25 and determining interaction of said agent 
with (i) or (ii) as a determination of effectiveness of interleukin-9 on said cell. 

27. The method of claim 26, wherein said agent is an antibody which specifically binds to 
(ii). 
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28. The method of claim 26, wherein said agent comprises the isolated nucleic acid 
molecule of SEQ ID NO: 7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 29, SEQ ID 
NO:24, or SEQ ID NO: 25. 

29. A method for stimulating activation of a STAT protein comprising administering an 
amount of the protein of claim 20 sufficient to stimulate activation of said STAT 
protein. 

30. The method of claim 29, wherein said STAT protein is STAT1 , STAT3 or STAT 5. 

31 . A method of inhibiting activation of a STAT protein, comprising administering an 
amount of an antagonist of the protein of claim 20 sufficient to inhibit stimulation of 
expression of said STAT protein by said protein. 

32. The method of claim 3 1 , wherein said STAT protein is STAT1 , STAT3 or STAT 5. 

33. A method for determining presence of TIF in a sample, comprising contacting said 
sample with an agent which binds to TIF or a nucleic acid molecule encoding TIF, and 
determining said binding as a determination of TIF in said sample. 

34. The method of claim 33, wherein said agent is an antibody. 

35. The method of claim 33, wherein said agent is a nucleic acid molecule. 

36 . A method for screening to determine if a substance influences IL-9 activity , comprising 
adding said substance to a sample of TIF producing cells, in the presence of IL-9, and 
determining production of TIF, wherein a difference in production of TIF by said cells 
as compared to production of TIF by said cells in presence of IL-9 but not said 
substance indicates said substance influences IL-9 activity. 
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37. The method of claim 36, wherein said substance is an IL-9 inhibitor or antagonist, said 
method further comprising determining lower levels of TIF production by said cells in 
the presence of said substance as compared to its absence. 

38. The method of claim 36, wherein said substance is an IL-9 activator, said method 
further comprising determining higher levels of TIF production by said cells in the 
presence of said substance as compared to its absence. 

39. A method for determining an aberrant level of IL-9 activity in a subject, comprising 
determining level of TIF in a subject and comparing said level to a normal level, 
differences therebetween being indicative of an aberrant level of IL-9 in said subject. 

40. The method of claim 39, wherein said aberrant level is excess endogenous IL-9. 

41 . The method of claim 39, wherein said aberrant level is insufficient endogenous IL-9. 

42. The method of claim 40, wherein said subject suffers from asthma, an allergy, or 
lymphoma. 

43. A method for inhibiting IL-9 induced activity in a subject in need thereof, comprising 
administering an amount of a TIF inhibitor sufficient to inhibit IL-9 induced activity. 

44. The method of claim 43, wherein said TIF inhibitor is an antisense molecule. 

45. The method of claim 43, wherein said inhibitor is an antibody. 

46. A method for treating a subject suffering from asthma or an allergy, comprising 
administering to said subject an amount of a TIF mutein sufficient to alleviate said 
asthma or allergy. 

47. A method for determining if a mutein of TIF is therapeutically useful, comprising 
contacting a cell which produces IL-9 with said mutein, and determining effect of said 
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mutein on prouction of IL-9, reduction thereof being indicative of possible thereapeutic 
efficacy for said mutein. 

48. A method for determining susceptibility to a condition characterized by aberrant 
expression of TIF, comprising determining nucleotide sequence of a TIF gene of a 
subject believed to possess an aberrant TIF gene, presence of an aberrant TIF gene 
being indicative of possible susceptibility to asthma or allergy. 

49. The method of claim 48, comprising contacting a sample taken from said subject with 
a pair of oligonucleotide primers which amplify said TIF gene. 
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TIFalpha cDNA 



taaacaggctctcctctcacttatcaactgttgacacttgtgcgatctctgATGGCTGTCCTGCAGAMTCTATGAGTTT 

TTCCCTTATGGGGACTTTGGCCGCCAGCTGCCTGCTTCTCATTGCCCTGTGGGCCCAGGAGGCAAA 

TGCGCTGCCCGTCA 

ACACCCGGTGCMGCTTGAGGTGTCCMCTTCCAGCAGCCGTACATCGTCAACCGCACCTTTATGCT 
GGCCAAGGAGGCC 

AGCCTTGCAGATAACAACACAGACGTCCGGCTCATCGGGGAGAAACTGTTCCGAGGAGTCAGTGCT 
AAAGATCAGTGCTA 

CCTGATGAAGCAGGTGCTCAACTTCACCCTGGMGACGTTCTGCTCCCCCAGTCAGACAGGTTCCA 
GCC CTACATG C AGG 

AGGTGGTACCTTTCCTGACCAAACTCAGCAATCAGCTCAGCTCCTGTCACATCAGCGGTGACGACCA 
GAACATCCAGAAG 

AATGTCAGAAGGCTGAAGGAGACAGTGAAAAAGCTTGGAGAGAGTGGAGAGATCAAGGCGATTGG 
GGAACTGGACCTGCT 

GTTTATGTCTCTGAGAMTGCTTGCGTCTGAgcgagaagaagctagaaaacgaagaactgctccttcctgccttctaaaa 

agaacaataagatccctgaatggacttttttactaaaggaaagtgagaagctaacgtccatcatcattagaagatttcac 

atgaaacctggctcagttgaaaaagaaaatagtgtcaagttgtccatgagaccagaggtagacttgataaccacaaagat 

tcattgacaatattttattgtcactgatgatacaacagaaaaataatgtactttaaaaaattgtttgaaaggaggttacc 

tctcattcctttagaaaaaaagcttatgtaacttcatttccatatccaatattttatatatgtaagtttatttattataa 

gtatacattttatttatgtcagtttattaatatggatttamatagaaacattatctgctattgatatttagtataagg 

caaataatatttatgacaataactatggaaacaagatatcttaggctttaataaacacatggatatcataaaaaaaaaa 



MAVLQKSMSFSLMGTLAASCLLLIALWAQEANALPVNTRCKLEVSNFQQPYIVNRTFMLAKEASLADNNT 
DVRLIGEKLF 

RGVSAKDQCYLMKQVLNFTLEDVLLPQSDRFQPYMQEWPFLTKLSNQLSSCHISGDDQNIQKNVRRLK 

ETVKKLGESGE 

IKAIGELDLLFMSLRNACV 



aacagactctcctctcagttatcaacmtgacacttgtgcgatcggtgATGGCTGTCCTGCAGAAATCTATGAGTTTT^ 

CCCTTATGGGGACTTTGGCCGCCAGCTGCCTGCTTCTCATTGCCCTGTGGGCCCAGGAGGCAAATG 

CGCTGCCCATCAAC 

ACCCGGTGCAAGCTTGAGGTGTCCAACTTCCAGCAGCCGTACATCGTCAACCGCACCTTTATGCTG 
GCCAAGGAGGCCAG 

CCTTGCAGATAACAACACAGACGTCCGGCTCATCGGGGAGAAACTGTTCCGAGGAGTCAGTGCTAA 
GGATCAGTGCTACC 

TGATGAAGCAGGTGCTCAACTTCACCCTGGAAGACATTCTGCTCCCCCAGTCAGACAGGTTCCGGC 
CCTACATGCAGGAG 

GTGGTGCCTTTCCTGACCAAACTCAGCAATCAGCTCAGCTCCTGTCACATCAGTGGTGACGACCAGA 
ACATCCAGAAGAA ^ 

TGTCAGAAGGCTGAAGGAGACAGTGAAAAAGCTTGGAGAGAGCGGAGAGATCAAAGCGATCGGGG 
AACTG GACCTGCTGT 

TTATGTCTCTGAGAAATGCTTGCGTCTGAgcgagaagaagctagaaaacgaagaactgctccttcctgccttclaaaaag 
aacaataagatccctgaatggacttttttactaaaggaaagtgagaagctaacgtccaccatcattagaagatttcacat 
gaaacctggctcagttgaaagagaaaatagtgtcaagttgtccatgagaccagaggtagacttgataaccacaaagattc 
attgacaatatutattgtcattgataatgcaacagaaaaagtatgtactttaaaaaattgtttgaaaggaggttacctc 




TIFbeta cDNA 
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tcattcctctagaagaaaagcctatgtaacttcamccataaccaatactttatatatgtaagtttatttatlataagt 

atacattttatttatgtcagmattaatatggatttatttatagaaaaattatctgatgttgatalttgagtalaaagc 

aaataatatttatgataataactatagaaacaagatatcttaggctttaataaacacatgaatatcataaa 



TIFbeta Protein 



MAVLQKSMSFSLMGTLAASCLLLIALWAQEANALPINTRCKLEVSNFQQPYIVNRTFMLAKEASLADNNTD 
VRLIGEKLF 

RGVSAKDQCYLMKQVLNFTLEDILLPQSDRFRPYMQEWPFLTKLSNQLSSCHISGDDQNIQKNVRRLKE 

TVKKLGESGE 
IKAIGELDLLFMSLRNACV 



TIFalpha Genomic Sequence 

GTCTATCACCTGCTTAAGATTCTTCTAATTTATAAAAAAAACTATTTCTTAAMTGAAM 
GCACGTATTTA 

TAGCATGGTGTTCTGACCATGCAGGTACAGAGTGGAATGGTAAGAGGCGCTATTATCAGCATTAACC 
mC^CT^CMGCAAACTTGAAAT 

ACTGGAGTCCG 

CTGCTGTCCAACAGAGCTCTTGAGCACGCTCTCCTCTGTTTGCAATTTTATGTTCmGATCGACTCC 
GCAAGGTCTGAG 

CTTCGGCTCCTGATGGCCACCTTTCAACTTTCTGCATTTATGMCTCCATGTTTTMTCTTTTTATTAA 
AATCAGTGmGTGCAAGTCTGTWCACCCACATGTATGTCTGTGCACCAAGTGCTGCCTGGTGCTT 
GCAGGAGAGGGTGCCCTGGCACCGGAGTCACGGATGGTTGTGAGCCACCATGAGGATGCTGGGAG 
CCAG^GTGCAGC^TGCTCTTAACCACACGC 

T^AATA^^^CTGATGGCTTCACTCACTGCCACCTCCCCTTTGCATCTTTCTGCCAAGGAACACC 
C^C^CACTGCmCGCTCCTCAAGTCTGCACCTCTCAACAGGTCAAGATTCTCCAGTGTCCCTCTA 
TGTCCCTCTAACACTTTCTCCAGTGTCCCTCTMCACrrrCTCCAGTGTCCCTCTAACACTTTTGATCT 

ggg^aga^g\tctcacacagtgattttcatgacttcgcgttctagtctagatgtaggcatttgcgt 

AGGCGTCTGCTCCCGCTGCTTAGGAA^^ 
GCAAAA^ATCC^AGTATTTTGTAAATTCTCTTCTTCMCTATCCA 

TCATTTAAAAA ' 
TAATATTTTGAGACTTATGCTTGC/^AAGTAAMTGTCAGAGMTTAGCAM 

ttttaaAAAAA 

TCTATGCTTAAAATGTCTATTAGATTGTTCACTACCGATATTTCCAMCTTAACTTGACCTTGGCTATG 

tgta™catctaccatmcagtctctgmccagmcattctgtggcaatgggagc 

TTAAAAA/JwSaA^ 
TAAAAATTGA 
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AATAATCTCCAAGTmCATTATGGCm 
GGTCACTTCTA 

AAGAGATAAGAATCTATGAAGTTGGTGGGAAAATGAGTCCGTGACCAAAACGCTGACTCAATAGCTA 
CGGGAGATCAAAG 

GCTGCTCTACTCAATCAGMTCTACTACGGCAAAGCCATGGCTTTCTTTGAAAACCGTGTTTAGAAGA 
TTTCTGGGATTT 

GTGTGCAAAAGCACCTTGTTGGCCCTCACCGTGACGTTTTAGGGAAGACTTCCCATCTCTCAAGGTG 
GGAAGGCTTGGAG 

GTGGTGTCTTGTGGCCTCCTATGGTGGTTAGGTACTTCTCAGAAGACAGGACTGGAAATTAGATAAT 
GTCTGATGTCATA 

TCATTCACAATACCAAAAAAACCCTGGTGTCCCGATGGCTATAAAAGCAGCAACTTCTGCCTCTCCC 
ATCACAAGCAGAG 

ACACCTAAACAGGTAAGCACTCAGACCTCTACAGACAATCATCTGCTTGGTACCATGCTACCCGACG 
AACATGCTCCCCT 

GATGTTTTTGCCTTTTGCTCTCTCACTMCAGGCTCTCCTCTCACTTATCMCTGTTGACACTTGTGC 
GATCTCTGATGG 

CTGTCCTGCAGAAATCTATGAGTTTTTCCCTTATGGGGACTTTGGCCGCCAGCTGCCTGCTTCTCATT 
GCCCTGTGGGCC 

CAGGAGGCAAATGCGCTGCCCGTCAACACCCGGTGCAAGCTTGAGGTGTCCAACTTCCAGCAGCC 
GTACATCGTCAACCG 

CACCTTTATGCTGGCCMGGAGGTACAGCTGCATCTCTTTCTCTCCATACCGCCTTGCCATTTTCTCT 
G AAG C ACTTG C A 

AACTCTTTAGGGGCGCTTTATCTCCGCAGGTCTCACTACCTATGTTTTCTGTCTCTTTAGAGACTCTT 
TAAGGACTGGGT 

CTTTTrCTATTTCTATTTCMGGTCTCAGGACCATTTCCTATCTTGGCCTTCAGGACACATATA 
TTTTATCTACA 

GAGGCGCATTTAGAMGCCACCCACGACTGCMTACTTTCCATTTCTCTGTGCTCTCTTCTGAACTCA 
TACTCTCTTGGC 

TACfCCTGAGACCCACTGCGGACATACATCTCTACTTACAGGCTTTTCTTCCATCTCCTTGTCACCCA 
GGCACTTAGGGT 

TTTCTCTCTTTCAGGCCAGCCTTGCAGATAACAACACAGACGTCCGGCTCATCGGGGAGAAACTGTT 
CCGAGGAGTCAGT 

GTAAGTCCTCACTGTGATGAGCAGGGCTAGCTGCGGGAGCTGGTGGACCCTCTGGGATAGTCTGAC 
GTATGACCCCTGCT 

GCTTCTTGTCTACCTGCAGGCTAAAGATCAGTGCTACCTGATGAAGCAGGTGCTCAACTTCACCCTG 
GAAGACGTTCTGC 

TCCCCCAGTCAGACAGGTTCCAGCCCTACATGCAGGAGGTGGTACCTTTCCTGACCAAACTCAGCA 
ATCAGCTCAGCTCC 

TGTGTAAGTCTGACTCTGGCTACCTATGCTCCTCTCTCTTCCTCTTCTATTCCAGTAAGAACCCGAGG 
TCCTGCCCTCTC 

TCTCTTCACAAGAGTGAGGAGGGCCTCAGCACCACCACCATCATAGGCCACTTGAAATAGGTCACAA 
AGGCTTTGGCTTC 

AATTGAGTAATACTTTGAGTTTGTATGAGTGMGCTTTATTTGTTTTATC 
AAATTCTGTAG 

GATGAGAAAGATGTTGGGAACGAAAAAAGGCCTAGATAGAGAAACAGATCTGCTGAGTATAGTACTT 
ATGGGGGGAGCAG 

GGGGCGATATCCACTGAGTACMGTACTTGTGGGGAGAGAAATCCACTGAGTACAAGTACTTGTTG 
GCATGGAGATCCAC 

TGAGTACAAGTACTTGTGGGGGGAGGGAATGGCACAGAGCAAAAGTTGAAGGGAAGGAAGATGGA 
GAGGCCTCATGGTTG 

GGGGTGTGAAAGGTCACTCCTTTTCCATGTGATGGAGAGTTAAGAAAAACCAGTGTGTGAGTTTGAT 
GTCTTCAGACACC 

CCCAA CTATG AAACATATCCACGAGGAGCGGGCAGACTGTGGGAGACCTGGCATTTAGGGAAGGC 
GCGGCTTTTCACACG 
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AGAAACTTTATGCTCATCTCTTGTGCTACACTCCCACCTTTGATGAGGTTCAGCTCAGGTTTCGTTTC. 
TACCGTTCTTGC 

TACTGGTGGAAACTTCAGTAGGATTCCCCAAAGACGAGGACAGCTCTTCTGTAAGGGAGGGACCTG 
GATTTCAGTGTCCT 

AGAGAACGAAATAGCTCAGAGAATCTAGGTCAACGTGAAATCTAGGTCACAGCGGGCAAAAATGACT 
GAACGCCTCTATT 

CCAGGTGAACGGTCACGTGCCTCAGATATACTGAGGTATTGGGCTCCCACCGGATAAGATTCTGTTA 
GTGAGTCTGCTTT 

TATTTTGCAGCACATCAGCGGTGACGACCAGAACATCCAGAAGAATGTCAGAAGGCTGAAGGAGAC 
AGTGAAAAAGGTAC 

TATTGGCAAGCCACMTACTMGCCATTCAGTAGGAGACGTGGGGATTTCTTTCTCTGCTTCCCAGT 
CCCTTCTACTTTG 

TMCATTTTATTTGACTTGTCTACTATCTGGTCCATTACTCGCTTAGCTGCACCTGTATCTAGCTGGGT 
CTATAGATCTT 

TCAATCTGTGTCTAAATTTGTAAGTCACAATTCTGGAGCTAGCAGAAAGCTTAGCTCAGCCAGTCTCA 
TGAGCACTTGCT 

CGGAGGATGGCTTGTGACAGAGTCAATGCTAGAAGACAGCATCCCTGATTCCCAGCTCTGCACTTG 
CCTAGTGGCCATGT 

GTMTTACTTTGGCTTGATTMGTATTTGGGAAAGCCAGTTCCCACGGACCTACATAATCTGAAGAAC 
CATGCATTGAAA 

ACTAG AAAGCTGGGCACAAACTTACTAG AG ATGA I I 1 1 1 GAGCTCATTAAACGGATGCTCTGAAATGT 
GGCAAAATCAAC 

CCAGAATMCMCAAMGAGCTGGAmGCAMTAGGACMGTATTTAGAATCACTGGTATTAATAGC 
TATCATCTTAAT 

TAAAATATAGGGCCTATATATATATTTMGATTAMCACMGAGTGGATAGCCTCCCAATTTACTTGG 
CCTGGTTTCAAA 

AGAGTAAAAATATCAGTCATGGATTMTTATAGTGTCATGAAAGTATGAGATGGAAACCCTTTCCTTA 
CTTTTTACCTTC 

ATTTCTTAG I I I I I I I II I CTTCACACCCTGATCAAGCCACTAGTAAGCACCTATCTGCTGTGAGCTAT • 
TATATGACTTT 

ACAGCAAACAACATTGCTGTGTGGCCTCTTTGGGGAAGGGAACAGGATAGCAGGAGGCTCAGGCTA 
G C AAGTCTG ACTTG 

CCCTAAAGCCAGAGGCATGGTTGATAGCAGAGAAAGTGAGGCTCTTCGCAAGTGGGTGTGCTTAAG 
TAATCAGAAACAGG 

AAGGCTCCGGTTGATGGAATTATCAGTAAGATATCTACCCTTATCTCCTTCTATCGAACCTAAATCGT 

CTCI ! I I ICTTG 

TGTGTAGGCTGATAAACACACTTGTTTTCTTTT 

CAGTTCTTGTT 

AGAGGGTTTGTTACCTTGACACCTGGGCTTGGATGTTAGCATGCCAAAGGCACACACTTCTGAATGC 
CTGTGTAAAAGGT 

XATTATTCATTTACTTTGTCTTTGGAMGGTGMGCGTGTGTGAGAAAGAACTCACAGGAGATGTGTT 
CTCTGTAGGAAA 

AC TTTTTTn I CCCCTTAAATGCCTATMTCCACTTTCAGTCAACTTTGACTTTTATACCATGCTGTCAC 
ATGAAAGAGT 

GTTTAGGCCCGCTCTCATGGCTCTGGGAAAAGCACCAATAGGGGAAGGAATGTTATGCTGAGAAtAT 
CTGACCGGCAGGGA 

AACTGGTCAGAGCTCCCCCGAAGACCACCACAGGTGTTAAGTAGGAACAGTCCAGGGTGGGCTCAT 
GTAATAGAATGGAA 

CAGAGCGAGGGAAGATAAGCTACAAAGTTTCATAGGGTCCGGAGTCTTAAAGATACAAAATAGCTGC 
TTGGGCTTCATAA 

CAAAGGAAGTCTGGGAAGGCAGCAAGTGAGAGGGAAATGGAAAGGGAAAAAACAGAATGTAGAGG 
ACTTGAACAGCTACA 

AATCCTCTACCAGACGATTTTTCTTGGAACAATCTAGMGGTAGTGGATTAGGTGATTGCAGGGGGA 
CTTGCTTTGCCAT 
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TTGMTCTGGGTTTTTGTCTCTCCATTGAG 
GAAAGAAGGGGT 

GTTATGACTCCTACCTGGAGTTTTACTAGTTTACGCAATGGAACAGACACTCGGGACCTCCTCTTGA 
CAAAAAAAATGGA 

AACCTGTTGTirGTCTTGmGTTCTTTTGTTMGAMGCACAGGCAAAGCCCGACCACATGGG 
ATGTGGGTCTTT 

GAGTCAAGGCTTTTGAGTTGAGCACTCATCAATAGTTGATCATGGTCAGGTGGAGGGCTACCTGTCA 
GGCCGAGCCCTGC 

T.GGCTTCGCACTTAACATCTCCAGGTCTCAGTATCACTTCCTGCTACTTAGCACAGTTAGGAGTTGA 
GCAAACCTTTTTT 

TCCA^CCCCCACTAAMTTTMTTGACAAMGACTGTGTAATTTGTGGGATACAGTGTGATAATTGAT 
CTATGTGTG CAT 

TGTGCAAGGTTCMTMGATAGATTAATAGGCCCATCMCAGCTTTATGGGTGTGAAATGCAAGTAAT 
ATAGGTAGATGC 

CTGTGGTGTCCTTAGGTCAGAAAGGCATGATTTTMGGTCTTGGGCAAATCATATTATACTCATGCTA 
AAAATACATTAT 

GTTGATTATTAATCTTTTAGAGMGGCTGATACTTGGTTTTGGTGCTCAGCAAGCAAATGTCACCAGC 
TC7TTCTAACTG 

GTACCACTTTAGAAMTGCTACCTGTGCTCAAATTGGTTTGTATTCTTATTTTCATAGCTTGGAGAGA 
GTGGAGAGATCA 

AGGCGATTGGGGAACTGGACCTGCTGTTTATGTCTCTGAGAAATGCTTGCGTCTGAGCGAGAAGAA 
G CTAG AAAAC G AAG 

MCTGCTCCTTCCTGCCTTCTAAAAAGAACMTM^ 
TG AG AAG CTAAC 

GTCCATCATCATTAGMGATTTCACATGAAACCTGGCTCAGTTGAAAAAGAAAATAGTGTCAAGTTGT 
CCATGAGACCAG • 

AGGTAGACTTGATMCCACAMGATTCATTGACMTATTTTATTGTCACTGATGATACAACAGAAAAAT 
AATGTACTTTA 

AAAAATTGTTTGAMGGAGGTTACCTCTCATTCCmAGAAAAAAAGCTTATGTAACT^ 
TCCAATATTTT 

ATATATGTAAGTTTATTTATTATAAGTATACATTTTATTTATGTCAGTTTATTAATATG 
GAAACATTA 

TCTGCTATTGATATTTAGTATAAGGCAAATAATATTTATGACAATAACTATGGAAACAAGATATCTTAG 
G C | | I AATAAA 

CACATGGATATCATAMTCTTCTGTCTTGTMTTTTTCTCCCTTTAATATCAACAATACCATCATCATCA 
TCATTACCCA 

ATCATTCTCATGATTTCATGCTTGACCCATATTATACTGTTAAAGTTGGTTCCTGGAGGCCTGTGGTT 
TTGTGTGTGTTG 

TGTGTGTGTGTGGGGTTATGCATGTGAAAGCCAGAGATGGATATTAGGTGTTCTTCTCTATCAGTCTT 
TG C CTT ATTATT 

TGAGACAGGGTCTGTCACTGAACCTGTAGCTAGGCTGGCCAACAAGCTCTATTAAI I 1 I I I 1 I AAG AT 

TAATTAATTATG 

TGTAT 



exon 1a: nucleotides 1908-1932 
exoh 1b: nucleotides 2033-2262 
exon 2: 2655-2720 
exon 3: 2820-2963 
exon 4: 3931-3996 
exon 5: 6539-7137 
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Human TIF cDNA 



TGCACAAGCAGAATCTTCAGAACAGGTTCTCCTTCCCCAGTCACCAGTTGCTCGAGTTAGAATTG 

TCTGCAATGGCCGCCCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTG 

CCTCCTTCTCTTGGCCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGC 

TTGACAAGTCCAACTTCCAGCAGCCCTATATCACCAACCGCACCTTCATGCTGGCTAAGGAGGCT 

AGCTTGGCTGATAACAACACAGACGTTCGTCTCATTGGGGAGAAACTGTTCCACGGAGTCAGTAT 

GAGTGAGCGCTGCTATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTC 

AATCTGATAGGTTCCAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGG 

CTAAGCACATGTCATATTGAAGGTGATGACCTGCATATCCAGAGGAATGTGCAAAAGCTGAAGGA 

CACAGTGAAAAAGCTTGGAGAGAGTGGAGAGATCAAAGCAATTGGAGAACTGGATTTGCTGTTTA 

TGTCTCTCAGAAATGCCTGCATTTGACCAGAGCAAAGCTGAAA 

CCTGCTAGAAATAACAATTAGATGCCCCAAAGCGATTTTT 
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Human genomic sequence of TIF 
SHZ? ID a)0\ i<sr 

TGCAC^GCAG^TCTTC^^^CAGGTTCTCCTTCCCCAGTCACC^GTTGCTCGAGTTAGAATTGTCTGCAATGGCCGCCCTGCAGA 
AATCTGTGAGC7CTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCT7CTCTTGGCCCTCTTGGTACAGGGAGGAGCAGCTGCGC 
CCATCAGCTCCCACTGCAGGCTTGACAAGTCCAACTTCCAGCAGCCCTATATCACCAACCGCACCTTCATGCTGGCTAAGGAGgtat 
acatcccaatcccgccctcccccgctggatctactcggairccaaacagtccccaaacttttcttcagagcacctctaagagcttta 
ggaacccaccgcttacccccgagggtagataaattttctcrtttcccagagaccctttgggaacctggctttttttctttcttgaac 
ttcttccttccactttggcctttacgatacatatgacgaatctttcccaaagagcggccattcagtaatccatccgatgattttttt 
CtccCttatgcctctgcccatcgttctaaactcatgcacacatccgaBttccgctitttagtctttacgacgctgctctggggagacg 
ggacggggcacatgcccarcracaaattttttttctatttcctca&tgtccagacccttagccttttcttcccttccagGCTAGCTT 
CGCTGATAACAACACAGACC-TTCGTCTCATTGGGGAGAAACTGTTCCACGGAGTCAGTg t aagc t acag c t g t gacgaacagggccg 

tgtgccgtccatgggcacctcgggcggtggtgatgacggrttaggzcctatcccttatgaccctctccgtttcccttccacccgcag 
ATG AGTG AGCG CTGCT ATCTG ATGAAGC AGGTGCTG AAC77CACCC T IGAAGAAGTGCTGTTCCCTCAATCTG AT AGGTTCCAGCCT 
TATATGCAGGA.GGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTgtaagttcagctcCcagcctatgcccacct 
acccctccttccccccctccacagagacccccccaccccaactctc-cwccccccccctacccccaagccagcaggaagaagtgtct 
tggcagcagcg;caccaaga=;cacctgggatcatagagiaccccct^tcgctttgactgagccacaccttgagcctatagcggtga 
atggggtctggaacttaagtgiacagaagccgcattggttcgcctccggaaaaaaggcaactcaggctgcgtaagatgagaaaggtg 
tcgggaaaacac^tagctgtgaaaacggatccattgagtciaag^czcgagcggaggggatggcatggagagaaactagaagagaa 
agtgggaaacgggaaggctcaaagtcggtggcgggtcggcagactcz;gccccgtcgatgccatgggaagcca=aaaatcggaggcg 
tgtgaactcgatgccgcccaacactcgaaactacgaaaaaaagcccgagtggagcgggcccagtaaaaggccctaggacttactgaa 
gagggctcaattttcacarcagatgctttatgcacatcccrtgccciaagcatgcaatcccctggagacacgattgaggttttattc 
cttacagaacctgcataaar^accccgctcttcccacaaa:gcaaa:c:cagcaggattccccaaagacgaagagaggtctcttgta 
agggaagcgaccggaccc:cgcgcccaagggaactcaaga=ccca=gaaatctaggccactgttgaaatctaggtcattgtgggcaa 
aattactaagagccccaattccacgcgaactgtaccgtacctcca-grg-gtggaggttcataaagcttcagcacaacattaagata 
gt tatgctt c; cat tgi c c c ar a gC ATATTG AAGGTG ATG ACCTG CAT ATCC AG AGG AATGTGC AAAAGCTG AAGG ACACAGTGAAA 
AAGgcaggactgacaacccicaatgctaagtcatgcaataggagaracaaaccccgttttCctttccctcctcccttcccaccactt 
cgcgattccccacccgarrcicctaccaccagggcgattacctccgrcrctgtccatgtagacatatctatatacctagatgccagc 
ttccaaatctcgcaaaccg-agaattctagaaccggttggcaccccagcccgcctagtcacacaacctcagactctggggatggtca 
gtggcagagacagggccaraargcaggtctcctgaatcccaagccaciarttctcccggtggtgacacagaccagctttggcaccat 
taacccttagggaaatcicaratccccattgacccacgtaatcccaagaagtacctgtttaaaaacagaaaaacgcccatgggcaaa 
ttcatttgaagtcatttcrgaagtcatcaatgcactgcttcgaaaitcggaagaacaaactcagaacaatgagaaaagagctggact 
tgcacacagggctaaccccicgagtaacaaacacttatcttgaadarcataacatccatcagatattgactacagcttaaaagcaa 
gagcagacaaccccgacc:c:cctatacaggttcaaatagagtaaaaatattagtaagagatttattatagttaaatggaagtctga 
attggcaagccccttcccc;tcctcccccccatcaagaccicccatcc:agtcccttccttcactccctcaacaaatccctagggag 
cacttatccatggcgggcrcgrgtacatccctatagtgaatgacaccatcatgtggcctatctggtgaaaagaacaacaatggaagg 
cttagactaacaacagtgac-caccccaaaaccggaggaatgatcaggagcagtgaaagtgacgctcttgcaagcaggcacaactaa 
atactcagaaacacgaaggciccagccgacggaatcttcagwaacaagcttaaccctaattcccccttcttcccccccgacttttta 
aaaaagcgttccctcctcagratcactcaatgagcgtgaccctct:c;ccccttgacaaccgaaggctttgtagttccaaattgtgaa 
gcccagcccccttgttaragaaccactatctagacatggagggcccaatgctagcacgccacagacaaggcatgctttacacatctt 
gcctaaaaaaccactgatctcacctcgctcgtcgcctttacaaaagccaagtgcgagagaggagaaccccatggcgatctgtgtgat 
tttcaagaccttcaacccaicccgaaagaatcaatttcatatctgcaatgggttgccatgtggaagagcgaccacgcttttttgctg 
gtagcttcagaaagcacaggagggagagcaatgecgttcagagaaagarcaacaggaggagaaactgtcagagctgtctgaaatagg 
gtggtcctgggaggcattaattcccccccgtcgggggtaaaagcagaacgcaggttggtagtaaaatgcatgacagacagtagggga 
cgataaacttcaaaactccttatagtcttggagtccttgagacagaaaagaatacctttttggccttatgtcaaaagaagtatggaa 
aggtgaaagggcggaagaaagcaggaaaaggaagaaccatgtatcaratagaggacaacggtgacaaggtttttcttgaaataatgc 
aaatatgatagatcagagcaatcccagcagggaatgcttttcactcgaatttgggtttcctcttcgattaagtccgggatcctcatc 
tgcatttgacttggagagagaaagaatgaatgtcaggacccacacccggcttcccattaactaaagcaagtggaaaagacttatttg 
gtacttttcccacaaaagtgaaaactttcctctcactgtccgtcaaaaaggtggaaatagaaaaagccttaacgtattggtgaatac 
atggttcaaagtcatttcagtagagacgcctcaaatcaggagcgtccaaccatctggcctccctggaccaccctgaaagaattgtct 
tggcacacacacaaaacacaagaacaacagctgatgagccaaaaaagcccacgcacaaatctcatactgttttaagaaagtttacga 
attcctgctagggtgcatccaaagctgtcccgggccatgtgcggcctgrgggccgcaggctggacaagctcctcataagtaatctgt 
catagatagtcttggagccgcaaaacaggc^aaggcataargggtggcactcgggatcccccagatcccagcctcacttcagtctcc 
tcgctctggtcaagaaggggtggtcaactccccgcccagcttctaaacagcttcattagtgtgaggtgcacccgaaactgatgcctg 
ctggtggcctctcagtccacagagccgtcatttcaagctcrttggcaaaccatacaatactaaagggatattactatgaatgttcta 
caaatgcccaaaactcggtCwCtgwCtccatcaacctaatcttgcaatCCctaatttgttcacCtcagaaaacatggcataaatgct 
caaat ac 1 1 C tgcat C CC C at t C t c ac agCTTGG AG AG AG7GG AGAG AT CAAAGC AATTGGAG AACTGG ATTIGCTGTTT ATGTCTC 
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TIFbeta Genomic Sequence C \T) /JQ ) ~}/J 

GAATTCAAGTCCACATGCMTCAATCCGAATACTTTGTAMTTCTCTTCTTCAAATATCCATCTATATA 
GTATMGTTATTGTAGGATCATTTAAAAATAATGTm 

AGAGAATTAGCAAATGTATAGTATTATTTTATTTTAAAAAATCTATGCTTAAMTGTCT^ 
C ACTACTG ACATTTCCAAACTTAACTTG ACCTTG GCTATG ATTTC 

AACTGTGTGCTCACTTACCATGCTATCCGACGAGCATGTTCCCCTGATGTTTTTGCCTTTTGCTCTCT 

CGCTAACAGGCTCTCCTCTCAGTTATCAACTTTTGACACTTGTGCGATCGGTGATGGCTGTCCTGCA 

GAAATCTATGAGTTTTTCCCTTATGGGGACTTTGGCCGCCAGCTGCCTGCTTCTCATTGCCCTGTGG 

GCCCAGGAGGCAAATGCGCTGCCCATCAACACCCGGTGCAAGCTTGAGGTGTCCAACTTCCAGCAG 

CCGTACATCGTCAACCGCACCTTTATGCTGGCCAAGGAGGTACAGCTGCATCTCTTTCTCTCCATAC 

CGCCTTGCCATTTCTCTGAAGCACTTGCAAACTCTTTAGGGGCGCTTTATCTCCGCAGGTCTCACTA 

CCTATGTTTTCTGTCTCTTTAGAGACTCT7TAAGGACTGGATCTTTTTCTATTTCT 

AGGACCATTTCCTATCTTGGCCTTCAGGACACATATACTGAATTTTATCTACAGAGGCGCGTTTAGAA 

AGCCACCCACGACTGCAATACTTTCCATCCTGTTGTGCTCTCTTCTGAACTCATACTCTCTTGGCTAC 

TCCTGAGA CCCA CTGCGGACATACATCTCTACTTACAGGCTTTTCTTCCATCTCCTTGTCACCCAGGC 

ACTTAGGGTTTTCTCTCTTTCAGGCCAGCCTTGCAGATAACAACACAGACGTCCGGCTCATCGGGGA 
GAA 

ACTGTTCCGAGGAGTCAGTGTAAGTCCTCACTGTGATGAGCAGGGCTAGCTGCGGGAGCTGGTGGA 

CCCTCTGGGATAGTCTGACGTATGACCCCTGCTGCTTCTTGTCTACCTGCAGGCTAAGGATCAGTGC 

TACCTGATGAAGCAGGTGCTCAACTTCACCCTGGAAGACATTCTGCTCCCCCAGTCAGACAGGTTCC 

GGCCCTACATGCAGGAGGTGGTGCCTTTCCTGACCAAACTCAGCAATCAGCTCAGCTCCTGTGTAA 

GTCTGGCTCTGGCTACCTATGCTCCTCTCTCTTCCTCTTCTATTCCAGTAAGAACCCGAGGTCCTGC 

CCTCTCTCTCTTCACAAGAGTGAGGAGGGCCTCAGCACCACCACCATCATAGGCCACTTGAAATAG 

GTCACAAAGGCTTTGGCTTCAATTGAGTAATACTTTGAGT 

CCATGGAAAGAAATCMCTCAAATTCTGTAGGATGAGAAAGATGTTGGGAACGAAAAAAGGCCTAGA 

TAGAGAAACAGATCTGCTGAGTACAGTACTTATGGGGGGGGGGGGCAGGGGGCGATATCCACTGA 

GTCCAAGTACTTGTTGGGAGAGAAATCCACTGAGTACAAGTACTTGTGGGGGAAGGAATGGCACAG 

AGCAAAAGTTGAAGGGAAAGAGGAAGATGGAGAGGCCTCAATGTTGGGGGTGTGAAAGGTCACTCC 

TTTTTCCATGTGATGGAGAGTTAAGAAAAATCAGTGTGTGAGTTTGATGTCTTCAGACACCCCAACTA 

TGGCAGACTGTGGGAGACCTGGCATTTAGGGMGGCGCGGCTTTTCACACGAGAAAC7TTATGCTC 

ATCTCTTGTGCTACACTCCCACCTTTGATGAGGTTMGCTCAGGTTTCGTTTCTACCGTTCTTGCTAC 

TGGTGGAAACTTCAGTAGGATTCCCCAAAGACGAGGACAGCTCTTCTGTAAGGGAGGGACCTGGAT 

TTCAGTGTCCTAGAGAACGAAA 

TAGCTCAGAGAATCTAGGTCAACGTGAAATCTAGGTCACAGCGGGCAAAAATGACTGAACGCCTCTA 

TTCCAGGTGAACGGTCACGTGCCTCAGATATACTGAGGTATTGGGCTCCCACCGGATAAGATTCTGT 

TAGTGAGTCTGCTTTTATTTTGCAGCACATCAGTGGTGACGACCAGAACATCCAGAAGAATGTCAGA 

AGGCTGAAGGAGACAGTGAAAAAGGTACTATTGGCAAGCCACAATACTAAGCCATTCAGTAGGAGA 

CGTGGGGATTTCTTTCTCTGCTTCCCAGTCTCTTCTACTTTGTAACATTTTCTTTGACT^ 

CTGGTCCATTACTCACTTAGCTGCACCTGCATCTAGCTGGGTCTATAGATCTTTCAATCTGTGTCTAA 

ATTTGTAAGTCACAATTCTGGAGCTAGCAGAAAGCTTAGCTCAGCCAGTCTCATGAGCACTTGCTCG 

GAGGATGGCTTGTGACAGAGTCAATGCTAGAAGACAGCATCCCTGATTCCCAGCTCTGCACTTGCCT 

AGTGGCCACGTGTAATTACTTTAGCCTGATTAAGTATTTGGGAAAGCCAATTCCCACCGACCTACATA 

ATCCGAAGAAGCATGCATTGAAAACTAGAMGCTGGGCACAAACTTACTAGAGATGATTTTTGAGCT 

CATTAAACTGATGCTCTGAMTGTGATCAMTCAACCCAGMTMCMCAAAAGAGCTGGATTTGCAA 

ATAGGACAAGTATTTAGMTCACTGJGTATTMCAGCTGTCATCTTMTTAAAATATAGTG 

CTGCCTATTTAAGATTAAACACAAGAGTGGATAACTTCCCMTTTACTGGGCCTGGTTTCAATAGAGT 

AAAAATATCAGTCATAGATTAATTATAGTGTCATGAAAGTATGAGTTGGAAACCC^ 

ACCTTC ATTTCTTAGTTATTA I i I 1 I I I I ICTTCACACCCTGATCAAGCCACTAGTAAGCACCTATCTG 

CTGCGA 

GCTATTATATGACTTTACAGCAAACAACATTGCTGTGTGGCCTCTTTGGGGAAGGGAACAGGATAGC 
AGGAGGCTCAGGCTAGCAAGTCTGGACTCAACCTAAAGCCAGAGGCATGGTTGATAGCAGAGAAAG 
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TGAGGCTCTTCACAAGTGGGTGTGCTTAAGTAATCAGAAACAGGAAGGCTCTGGTTGATGGAATTAT 

CAGTAAGATATCTACCCTTATCTCCTTCTTCTATAGAAGCTAAACCGTCTCTCCTTCTTGTGTGTAGG 

CTGATAAACACGCTTGTTTTCTTTTGAGTGTTCATGGCTTTGCAGATTTTCAGTGC 

GTTAGAGGGTTTGTTACCTTGACACCTGGGCTTGGATGTTAGCATGCCAAAGGCACACACTTCTGAA 

TGCCTGTGTAAAAGGTTATTATTCATTTAC TTTGTCTTT GGAAAGGTGAAGTGTGTGTGAGAAAGM 

TCACAGGAGATGTATTCTCTGTAGGAAAAC I I I I I I I ICCCCTTAAAAGCCTATAATCCACTTTCAGTC 

AACTTTGACTTTTATACCATGCTGTCACATGAAAGAGTGTTTAGGCCCGCTCTCGTGGCTCTGGGAA 

AAGCACCAATAGGGGAAGAAATGTTATGCCGAGAAATCTGACTGGCAGGGAAACTGGGTCAGAGCT 

CCCCAAAGACCACTACAGGTGTTAAGTAGGAACAGTCGAGGGTGGGTTCATATAATAGAATGGAACA 

GAGGGAGGGAAGATAAGCTACAAAGTTTCATAGGGTCCTAAGTCTTTAAGATACAAAATAGCTGGTT 

GGGCTTCATAACAAAGGAAGTCTGGGAAGGCAGCAAGCATTGAGAGGGAGATGGAAAGGGAAAAAA 

CMTGTAGAGGATTTGAAAAGCTACAAATCCTCCACGAGAGGATTTTTCTTGGAGGAATCTAGAACA 

AGGGTGGTGGATTAGGTGGATCGCAGAAGGACTTGCTTTGCCATTTGAATCTGGGTTTTTGTCTCTC 

CATTGAGGTTGAGA 

GCGTCACCCTTTTTTACCCTGGATAGGAGGAGGAMGMGGGGTGTTTTGACTCCTACCTGGAGT7T 

TACTAGTTTACGCAATGGMCAGA CACT CGGGACCTCCTCTTGACAAGAAAAAAAAAAAAAAAAGGA 

MCCTGTTGTTTCTCTTGmGTTCTTTTGTTAAGAMGCACAGGCAGCTGGGCATGGTC 

CCTTTAATCCCAGCATTTGGGAGGCAGAGGCAGGTGACTTTCTAAATTCAAGGCCAGCCTGGTCTAC 

AAAGTGAGTTCCAGGACAGCCAGGGCTATACAGAGAAACCCTGTCTCGGGAAAAAAAAAAAAGAAG 

AAAAGAAAAGAAAAGAAGAGAAGAGGAGAGGAGAGGAGAGGAGAGGAGAGGAGAGGAGAGGAGA 

GGAGAGGAGAGGAGAGGAGAGGAGAAGAGAAGAGAAGAGAAGAGAAGAGAAGAGMGAGAAGAG 

AAGAGAAGAGAAGAGAAGAGAAGAGAAGAGAAGAGAAGAGAAGAGAAGAGAAGAGAAAAGAAAAG 

AGAAAAGAAAAGAAAAAAGCAAGCAAGCAAGCACTGGCAAAGCATGCCCACATGGGACGTATGTGG 

GTCTTTGAGACAAGGC7TTTGMTTGAGCGCTCATCAATAGTTGATCATGGTCAGGTGGAGGGCTAC 

CTGTCAGGCCGAGCCCTGCTGGCTTAGCACTTAACATCTCCAGGTCTCAGTATCACTTCCTGCTGCT 

TAGCACAGTTAGGAGTTGAGCAAACCTTTTTTTCCMCCCCCACTAAAATTTM 

TGTAATTTGTGGGATACAGTGTGATAATTGATCTATGTGTGCATTGTGCAAGGTTCAATAAGGTAGAT 

CAATAGGCCCATCAACAGCTTTATGGGTGTGAAATGCAAGTAATATAGGTAGATGCCTGTGTGTCCT 

TAGGTCAGAAAGGCATGATTTTAAGGTCTTGGGCAAATCATATTATACTCATGTTAAAMTGCATTA 

GTTGATTATCAATCTTTTAG AGAA 

GGCTGATACTTGGTTTTGGTGCTCAGCAAGCAAATGTCACCAGCTCTTTCTAACTAGTACCACTTTAG 

AAAATGCTACCCGTGCTCAAATTGGTTTGTATTCTTATTTTCATAGCTTGGAGAGAGCGGAGAGATCA 

AAGCGATCGGGGAACTGGACCTGCTGTTTATGTCTCTGAGAAATGCTTGCGTCTGAGCGAGAAGAA 

G CTAG AAAAC G AAG AACTG CTCCTTCCTG CCTTCTAAAAAG AACAATAAG ATCC CTG AATG G ACTTTT 

TTACTAAAGGAAAGTGAGAAGCTAACGTCCACCATCATTAGAAGATTTCACATGAAACCTGGCTCAG 

TTGAAAGAGAAAATAGTGTCAAGTTGTCCATGAGACCAGAGGTAGACTTGATAACCACAAAGATTCA 

TTGACAATATTTTATTGTCATTGATAATGCAACAGAAAAAGTATGTAC 

AGGTTACCTCTCATTCCTCTAGAAG AAAA GCCTATGTMCTTCATTTCCATAACCMTACTTTATATAT 
GTAAGTTTATTTATTATMGTATACATTTTATTT^^ 

TTATCTGATGTTGATATTTGAGTATAAAGCAAATAATATTTATGAT AATAA CTATAGAAACAAGATATCT 
TAGGCTTTAATAMCACATGMTATCATAAATCTTCTC 

TACCATCATCGTCATCATTACCCAATCATTCTCATGACTTCATGCTTGACTCATATTATCTGGTAAAGT 
TTG 



exon 1: nucleotides 355-584 
exon2: 976-1041 
exon 3: 1 141-1284 

exon 4: 2204-2269 
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